From Document to Entity Retrieval: Improving Precision and Performance of Focused Text Search
نویسنده
چکیده
PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Universiteit Twente, op gezag van de rector magnificus prof. dr. to my grandfather who should have gotten a PhD long before me vi Acknowledgments Writing a thesis that sums up my scientific work of four years was a new experience for me. First of all it asked quite some patience from myself. Instead of looking forward to new scientific challenges, it forced me to re-read, rethink , and rewrite what I had done before. The confrontation with the past brought up old ideas, scientific plans, things I did as well as things I never found the time to do. And, last but not least, it made me think of all the people that accompanied me through that period and made it an exciting, enjoyable time. First, I'd like to thank my supervisor Djoerd for all his detailed reviewing work on this thesis and on my other scientific writing, which improved the presentation " by far ". But also for the nice working atmosphere we had during the whole period of my PhD, and for just being around for all kinds of questions and discussions starting on work issues but not always ending there. There have been many more people though who contributed to this research work. My promoter Peter, who always tried to keep me on track, and without him I would probably not have finished my PhD in time. who did an excellent job in reviewing my scientific work. All those people gave many fruitful input to my own work, and at the same time teached me to defend my own writing. I also want to thank the database group at the UT for the good working environment and the friendly atmosphere; our soup cooperation for providing at least the remembrance of a warm lunch. To pick out a few people: It was Maurice who had the brilliant idea to ask me whether I would like to come to the Netherlands at a time when I was not really thinking of doing a PhD. Developing our own search system PF/Tijah would not have been that vii viii successful and fun without our scientific programmer Jan, who helped me a lot with my code work when he was not climbing mountains at the remotest places of the world. Further, Sandra, Ida, and Suse could hardly have done more to support …
منابع مشابه
Improving Precision of Keywords Extracted From Persian Text Using Word2Vec Algorithm
Keywords can present the main concepts of the text without human intervention according to the model. Keywords are important vocabulary words that describe the text and play a very important role in accurate and fast understanding of the content. The purpose of extracting keywords is to identify the subject of the text and the main content of the text in the shortest time. Keyword extraction pl...
متن کاملIIT TREC 2006: Genomics Track
For the TREC-2006 Genomics Track, we report on the effectiveness of composite information retrieval functions based on a dimensional data model for improving document, passage, and aspect search precision of genomics literature. We designed an approach, and developed a corresponding search engine, based on a novel dimensional data model capable of document, paragraph, sentence, and passage leve...
متن کاملThe State-of-the-arts in Focused Search
The continuous influx of various text data on the Web requires search engines to improve their retrieval abilities for more specific information. The need for relevant results to a user’s topic of interest has gone beyond search for domain or type specific documents to more focused result (e.g. document fragments or answers to a query). The introduction of XML provides a format standard for dat...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملExperiments with Geographic Evidence Extracted from Documents
For the 2008 participation at GeoCLEF, we focused on improving the extraction of geographic signatures from documents and optimising their use for GIR. The results show that the detection of explicit geographic named entities for including their terms in a tuned weighted index field significantly improves retrieval performance when compared to classic text retrieval.
متن کاملMultimodal Medical Image Retrieval: Improving Precision at ImageCLEF 2009
We present results from Oregon Health & Science University’s participation in the medical retrieval task of ImageCLEF 2009. This year, we focused on improving retrieval performance, especially early precision, in the task of solving medical multimodal queries. These queries contain visual data, given as a set of image-examples, and textual data, provided as a set of words belonging to three dim...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008